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About this guide 


This guide is divided into four sections: 


What is profiling? provides a brief description of the principles and objectives behind statistical 
profiling. 


Statistical Profiler Tutorial takes you in simple steps through setting up the Profiler, profiling 
a simple program, and displaying and interpreting the results. By completing the tutorial, which 
takes about half an hour, you will be familiar with all the Statistical Profiler’s features. 


Appendix A explains the features on the Profilers’s shortcut menu. 
Appendix B gives a detailed description of the format of a Profiler data file (-prf). 


NOTE: The example files referred to in the tutorial are included on the release CD in 
Tutorials\Profiler\Exam1 and ...\Exam2. 


About this guide 


What is profiling? 


Statistical profiling 


The Statistical Profiler included with CodeScape is an analysis tool for examining the run-time 
behavior of programs written for Hitachi SuperH™ microprocessors. It shows where your 
program spends its time so you can identify inefficient sections of code. 


When your program executes, the Profiler takes regular samples of the processor's program 
counter (PC) at a configurable rate from 1Hz to 1kHz. From the address returned with each PC 
sample it can resolve which function your program is in at that moment in time. 


Over a period of execution time, a profile of these samples is built-up which shows the 
approximate percentage of time spent in each function, source line, and instruction of your 
code. These values are used to identify hot spots where your program spends disproportionate 
amounts of time. 


Statistical profiling is passive which means it does not modify your code to gather performance 
data. Therefore your program’s execution time is not noticeably affected when the Profiler is 
running. 


Function trace profiling 


Function trace profiling lets you generate function and trace profiles which provide more detail 
about how specific functions are called. The method employed in CodeScape to do this depends 
on the model of Hitachi SuperH™ microprocessor you are debugging. Contact 

enquiries@ crossproducts.co.uk for more information about function trace profiling with 
CodeScape. 


What is profiling? 


Statistical Profiler Tutorial 


About this tutorial 


This tutorial assumes you are running CodeScape on a supported Hitachi SuperH™ target 
microprocessor. It also assumes that you know how to use CodeScape and are familiar with its 
regular functions and regions. 


The example files used in this tutorial can be found in the CodeScape installation on your 
development PC under Tutorials\Profiler\Exam1 and Exam2. If you did not select the Tutorials 
option during installation, the tutorial files can be copied from the CodeScape CD. 


Profiling example 1 


Loading the example program file 


1. 


pm Co Ob 


Start CodeScape. 
In the Target window, right-click and select Load Program File... 
Browse to the file Tutorials\Profiler\Exam1\profexam.elf. 


Make the settings in the Load Program File dialog box as shown in Figure 1. 
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Figure 1: Load Program File options 


Load Program File 


5. Click OK. 
A progress box is displayed briefly and profexam.elf is downloaded onto the 
target. 


Setting up the Profiler 


1. From the Tools menu select Profiler. 
The Profiler Setup dialog box appears in CodeScape as shown in Figure 2. 


The steps below describe what settings to make in the Profiler Setup dialog box. 
2. Select Statistical Profiling. 


3. In the Sample Period text box, set the sample rate to 1000Hz. 
This is the rate at which the Profiler samples the target’s PC. There is no fixed 
rule to determine the optimum sample rate. The higher the sample rate, and the 
longer the duration of the profile, the more accurate the result will be. Normally 
the maximum sample rate of 1000Hz will not noticeably slow down the 
execution time of your program so we recommend that you always use this 
setting. 


NOTE: On some SH2 and SH3 processors running at slower clock speeds, the sample rate 


may have a greater effect on execution time but it should still be negligible. 
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Figure 2: Profiler Setup dialog 


Profiler Setup Dialog 


In the Hot Spot Threshold box click Hits >= Percent. In the text box enter 10%. 
This specifies the threshold above which functions are labeled with a hot spot in 
the Profiler results window. 

- Hits >= Percent labels all functions that exceed the specified percentage of the 
total number of hits counted during the profile. 

- Top ‘n’ Percent of Functions sorts the functions according to the number of 
times they were hit during the profile, the top ‘n’ percent of which are then 
labeled as hot spots. 

The value entered for this exercise of 10% is a rather crude value because the 
example program has only a few functions. For larger, practical applications you 
would make use of the six decimal places provided to set up lower hot spot 
thresholds with greater resolution. 


Select Show only ‘Hit’ functions after profile. 

With this selected the results of the Profile will show only those functions that 
were hit during the profile. All other functions are not displayed. This is a useful 
feature for profiling large programs. 


Click OK. 


In the Profiler window click - 
regions as shown in Figure 3. 


and a to display Source and Disassembly 
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Figure 3: The Profiler window before performing a profile 
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long main{) @0x0c010020: 
void Initialise() @0x0c010034 

void Buildlist(} @0x0c010050 

void CleanUp() &0x0c010044 

struct LL Node *GetNewNode() @0x0c010098 
struct LL_Node *GetLastNode() @0x0c0100ac 
void InsertWode(struct LL_Wode *, struct L 
struct LL_Node *MallocNode() &0x0c010114 
void FreeHeap() @0x0c010148 
TOTALS 
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Source region. 


int main(void) } 


ox0co10020 ; while(1) The “cl” column 
‘ . 

- shows the cache 

Mo a | line number for 
i 


0x0C010016 


Disassembly ox0c010018 0108 nov.w x0, e= each function. 
Ox0COLOOlLA ooo00 de.w §00 
region. oOxocolooic oo1c mov.b @(rO0,r1), xt 
— OxOCOLOOLE FFOO fedd fr, f 
| main Ao06 bre §0cO10030 | 
Ox0C010022 0009 <s> nop 
0x0C010024 BoOo6 bsr Initialise 
0x0C010028 BOl2 bsr Buildlist 


; 
H 
0x0C010026 0009 <s> nop | 
i 
j 
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Starting the profiler and running the program 


1. Click @ (red) to start the Profiler. 
The button is green when the Profiler is running and red when the Profiler is 
stopped. 


2. Press F9 to run the program. Let the program run for about 30 seconds and press 
F9 again to stop it. 
The Hits counter updates while the program is running. 
The length of time that you profile the program for depends on the size of the 
program you are debugging. The longer you allow the program to run, the 
greater the number of samples taken by the Profiler, and therefore the greater 
the accuracy of the result. 


3. Click (green) to stop the Profiler. 
The Profiler takes a few seconds to resolve the data, after which you see the final 
results in the Profiler window as shown in Figure 4. 
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Figure 4: The Profiler results display 
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4. Click on 4% in the Hits column. 
The cursors in the Source and Disassembly regions automatically move to locate 
the hot spot in the code. The point at which the icon is shown indicates a hot 
spot where the number of hits on that function meets the threshold requirements 
specified earlier. 


NOTE: Sometimes the Profiler cannot resolve the function name from the address returned 
by the PC sample, usually because of insufficient debug data. In this case the hits 
are scored against “Other functions” in the function list. The number hits on “Other 
functions” can include hits from more than one function. 


Analyzing the profile data 


This is a simple program that creates a single-ended linked list of ascending numbers from 1 to 
1000, clears the list, and then starts again. 


The hot spot in the Hits column shows that the program spent 98.5% of the time during the 
profile sample in the function GetLastNode. In this program, spending 98% of the execution 
time in just one function is extremely inefficient. 
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i et 


The Hits column does not show the actual number of hits on each function during the profile, 
instead, statistical profiling gives the number of hits detected by a periodic sample. Therefore 
the number of hits detected during statistical profiling is an indication of the time spent in each 
function rather than the actual number of hits on each function. 


In the function GetLastNode, the program locates the last entry in the list by searching from the 
start of the list until it reaches the last node. The problem occurs in a particular loop in 


GetLastNode, this is highlighted by another three hot spots beneath GetLastNode which are 
shown in the Source region. 


Figure 5: Analyzing the Profiler results 
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void BuildList() @0x0c010050 194 (.643022%) 002: 
void Initialise() @0x0c010034 2 (.006629%) 001: 
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The number in bold next to GetLastNode in the Source region shows the total number of hits 
on that function, and the numbers beneath show how many times each line in that function 
was hit during the profile. 


You can see the hits broken down into even more detail at instruction level in the Disassembly 
region. 


The Profiler’s results show that rewriting GetLastNode so that it does not search from the start 
of the list each time will remove the inefficiency and speed up the program. You will see this 
later in Example 2. 
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Tagging functions and displaying the results using different views 


1. In the function list, double-click on the functions GetLastNode and BuildList. 
The functions are tagged with a blue rectangle. 


(98. 4877%) 
42 (.132877%) 00 
145 (.458744%) 00. 
“182 (6074418) 00) 
95 (.313211%) 00 


(BP struct LL_Node *GetLastWode() &0x0cO100ac 
-f" struct LL_Node *GetWewHode(} G0x0c010098 
f° struct LL_Node smallocwate) Gox0c010114 


oid BuildList() & 


f" void InsertNode(struct LL_Wode *, struct LL 
~ ii TOTALS 


2. Click S@ al 


The button changes to al and the view changes to show only the tagged 
functions. 


St 31130 (98.4877%) 
oid Buildlist() 150 192 (.607441%} 
31608 


3. Click al 


The button changes to a) and the view changes to show only the functions that 
are not tagged. 


aoe Viewing File - C:\Current_projects\Profiler\Exam1\1541580908. pif 


OBMABES = tr 


bumestruct LL Node *GetNeuwNode{) @0x0c010098 
-F struct LL Node *MallocNode() 0x0c010114 


=-F void InsertWode(struct LL_Node *, struct LI| 
>I TOTALS 


42 (.132877%) 
145 (.458744%) 
99 (.3132118}) 


31608 


4. Click ae 


— 


The button changes to 8 sa and the view reverts to show all functions. 
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Where you have large function lists in your profile results: 
+ 
5. Click él to go to the next tagged function in the list. 
ab 
6. Click 2 to sort the functions alphabetically, either ascending or descending. 


7. In the Hits column, click Hits or cl to sort the list either by Hits; ascending or 
descending, or by cache line number; ascending or descending. 


Hits [el | 


Using the search facility 


NOTE: This feature is useful for large function lists. You can search for strings, whole 


words, or parts of words. 


1. Click on GetLastNode in the function list. 


2. Type “bu”. 
The cursor stops at BuildList. 


The search begins from the cursor and looks for exact matches first, then the 
nearest matches in descending order. 


3. To continue the search press F3 or Enter. 


Renaming a function 


1. Click on GetLastNode in the function list. 


2. Right-click, click Rename Function... 
The Rename Function dialog box appears. 


3. In the dialog box type “HotSpot” and click OK. 
The function is renamed HotSpot in the Profiler’s function list. 


NOTE: This does not change the function’s name in the source code. Original source files 


remain unchanged, only the Profiler’s view changes. 
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Saving the profile data 


As profile data 


1. Right-click in the Profiler window, point to File, Save Profile and click Save 
Profile (.PRF)... 


2. In the dialog box browse to the location where you want to save the profile data, 
name the file and click Save. 
You can close the session and return to it later and reload the profile results. Only 
the profile data and Profiler Breakpoints are saved, the Profiler display settings 
and hot spot thresholds are lost. 


NOTE: The format of .PRF files is given in detail in Appendix B. 


As text 


1. Right-click in the Profiler window, point to File, Save Profile and click Save 
Profile (.TX7)... 


2. Fill in the options that you want to save in the text file and click Save. 
The options are described in Table 1. 


Figure 6: Save Profile Text Option 


Profiler Save Text Format Options 
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Table 1: Text format save options 


Saves the profile data for all the functions listed in the Profiler results 


Currently Displayed Saves the profile data for only the functions visible in the function list in the 
Profiler window. 


Tagged Functions Saves the profile data for only the functions that are tagged in the function 
list. 
Saves data associated only with functions. 
Saves data associated only with Trace profiling. 
Overlay Areas Saves additional overlay information if overlays are in use. 
Start Address Saves the start address of each function, this is the address shown in the 
Profiler function list 
End Address Saves the end address of each function, this is not shown in the Profiler 
function list. 


Hits/Cache Line Saves the data in the Hits and Cache Line columns 
Linker Label/Function Name Saves each function’s name and linker label 
Overlay ID Saves each function's overlay ID if overlays are in use. 


Qualified Name Saves the function's qualified name (function name + file name + class 
name) if the function is not unique. 

Address, Hits, Cache Line or Sorts the profile data by Address, Hits, Cache Line or Function Name as 

Function Name selected 


Direction in which the columns are to be sorted in the text file. 
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Profiling example 2 


Ih 
2; 


In the Target window, right-click and select Load Program File... 


Browse to the file Tutorials\Profiler\Exam2\profexam.elf and click OK. Keep the 
Load Program File options as specified for example 1. 
A progress box is displayed briefly and profexam.elf is loaded onto the target. 


Click @ 


Press F9 to run the program. Let the program run for about 30 seconds and press 
F9 again to stop it. 


Click Q| (green) to stop the Profiler. 
The Profiler resolves the data and shows the final results: 


(red) to start the Profiler. 


Figure 7: Profile results for example 2 
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Analyzing the profile data 


In example 2, the program has been rewritten to use a local pointer to remember the last inserted 
node. This means that GetLastNode does not appear in the profile results because the function 
is no longer used and has not been hit during the profile. 


Instead, you can see from the hot spots, that most of the work in this simple program is shared 
fairly evenly between four functions which is a much more efficient use of execution time. 
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Profiling a section of code using Profiler breakpoints 


This feature is used to isolate a section of code for profiling. Due to the nature of statistical 
profiling it is usually preferable to profile the complete program to get a picture of how the 
program performs as a whole, so you would not normally use Profiler breakpoints. 


There are two types of Profiler breakpoints, start and stop, these are different to normal 
breakpoints. When a start breakpoint occurs, an instruction is sent to the Profiler to start 
profiling. The Profiler runs until either a stop breakpoint occurs or until it is stopped manually. 
Unlike other breakpoints, on hitting a Profiler breakpoint, the execution of the code does not 
stop, instead the program continues to run. You can set other breakpoints in CodeScape that 
trigger different actions during profiling. 


e You can only set one Profiler start breakpoint, but you can set multiple Profiler 
stop breakpoints. 


e If the start breakpoint is at the beginning of a loop then you should set at least 
one stop breakpoint at the end of the loop otherwise a recursion can occur in the 
Profiler function call tree. 


e Program execution does not stop when a Profiler breakpoint occurs. 


e — It takes a finite amount of time to process a Profiler breakpoint, therefore a 
frequently hit breakpoint increases the time it takes to profile the program but 
does not effect the processor time (i.e.. the number of clock cycles). 


NOTE: You can set up breakpoints in CodeScape that will trigger the Profiler to start. You 
insert the breakpoints in your program in the normal way using CodeScape Source 
and Disassembly regions and then configure the Trigger Actions dialog box to start 
or stop the Profiler upon hitting the breakpoints. If you use this method, you must 
remember to enable the Profiler before running the program (stop/start button is 
green when the Profiler is enabled) 


Loading the program file 


In the Target region, right-click and load profexam.elf from the Exam2 directory. 
If you already have example 2 loaded from the previous exercise press 
CTRL+ALT+R. This is the default keyboard shortcut to restart the target, reload 
the executable code and reset the Profiler ready for profiling. 


Setting the Profiler breakpoints 


1. In the function list click on main. 
The cursor in the Disassembly region stops on main. 


2. Click on Main in the Disassembly region. 
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3. Press the down arrow key twice until the cursor stops on Initialise. 


4. Press F5 to insert a start breakpoint. 
A green hand appears at the insertion point to indicate a start breakpoint. 


5. Press the down arrow key six times to scroll down to the bra instruction at 


0x0C010030 in 


the Disassembly region. 


6. Press F5 to insert a stop breakpoint. 
A red hand appears at the insertion point to indicate a stop breakpoint. The 
Profiler automatically decides whether the breakpoint is a start or a stop based 
on your previous action. 


Start Profiler 
breakpoint 


Stop Profiler 
breakpoint Oe | 


Figure 8: Profiler start and stop breakpoints 


Disassembly 


All Functions 
F deng main() Soxded1od20 

{" struct LL_Node *GetNewNode() @0x0: 
fF struct LL_Node *MallocNode() @0x0d 
~f void Buildbist() @0x0cO10050 

~f void Initialise() @0x0c010034 


void InsertWode(struct LL Node *, 4 


[ 


Oo0000%) Ot), 


000000%) 004) 
oo0000%) +008 + 


o00000%) 001 | 
ooo000%) 007 | 


Of. 

0 [. 

0 [. i 
0 (.000000%) 002; 
0 [. 

0 [. 

0 


int main(void) 
{ 


6x0C010020 


ox0c010024 
0x0C010028 


6x0C010022 
Ox0C010024 
Ox0C010026 
0x0C010028 
Ox0C010024 
Ox0C01002C 
Ox0CO1002E 


wie fe 
{ i 
Initialise(): i 
BuildList()-: i 
id 
went 
0009 nop lo 
Bo06 bsr Initialyy 
0009 <s> nop 
Bol2 bsr BuildLio 4 
0009 <s> nop 
BOoA bsr CleanUp: 


Ox0C010030 


$0c010G 
ae 


You can see that only one breakpoint is visible in the Source region. This is because the loop 
that you have isolated to profile begins and ends on the same instruction in the source code. 


In some circumstances the stop breakpoint can appear before the start breakpoint in the Source 
region. This is due to the generated executable code being different from the source code as a 
result of compiler optimizations. You can always see exactly where the breakpoints occur by 

looking at the Disassembly region. 


NOTE: 


When setting Profiler breakpoints using the Source region, always check the 


Disassembly region because the Source addresses may not reflect the actual 
executable code. 
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Starting the Profiler and running the program 
1. Click i] (red) to start the Profiler. 


2. Press F9 to run the program. Let the program run for about 30 seconds and press 
F9 again to stop it. 


3. Click (green) to stop the Profiler. 


Figure 9: Results of profiling section of code using breakpoints 


Functions 

@Ox0c010024 < ENTRY POINT > 27 (.034164%) 
struct LL_Node *GetWewNode() @0x/3% 26299 (33.2776%) 004 
struct LL_Node *MallocNode() @0x(% 26299 (33.2776%) 008 
void Buildbist() @0x0cO10050 27 (.034164%) 002 
void CleanUp(} @0x0c010044 26 (.032899%) 002 
void FreeHeap() @0x0c010148 26 (.032899%) 010 


27 (0341648) O01 


void InsertWode{struct LL_Node +, a 26298 (33.2763%) 007 


int main(void) 


{ 
Ox0C010020 while({1) 


{ 
& 2? (.034164%) Ox0C010024 Initialise(}: 
27 (.034164%) — ox0C010028 Buildbist(); 


Be 


& 27 (.034164%) 


Ox0C010024 


Ox0C010026 ooo9 <s> 

27 (.034164%) Ox0C010028 BO1z Buildbist 
ox0colo0za ooos <s> 

26 (.032899%) Oxoco1s02c CleanUp 


0x0C01002E 
& 0x0C010030 
ox0c010032 

27 (.034164%) Initialise 
Ox0C010036 


¢0c010024 


#900, 
alist, 


NOTE: You can set the Profiler to execute only one pass between breakpoints from the 
Profiler shortcut menu. You must do this before enabling the Profiler. 


The Hits column and the hits scores in the Source and Disassembly regions show all the 
functions called from the section of code during the profile and the number of hits on all the 
functions within that section of code. 


In the example program, the linked list of numbers is built up to 1000 and then cleared. This is 
shown by the functions marked with a hot spot that have approximately 1000 more hits than 
the other functions called during the profile. The hot spot functions are called 1000 times for 
each loop of the program while the other functions are called only once. The program was in 
its 27th loop when the Profiler was stopped. 
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Appendix A: Profiler’s 
shortcut menu 


Table 2: The Profiler’s shortcut menu 


Select: To: 


Load or save profile information. Profiles are saved using the extension .prf. 


Trace Tree Profile Display Display a tree showing each function, the functions that called it, and the 
functions it called. Trace profiling only. 
Function Profile Display Display a simple list of the functions in your program in the Profiler window. 


functions, or all untagged functions. 
Enter a new name for a specific function. 
Specify the profile display options. 
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Select: To: 


Optimize... Instruction cache optimization. Trace profiling only. 


Setup... Specify options for statistical profiling or function trace profiling. 
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Appendix B: Profiler File 
Format 


This appendix describes the Profiler File Format. The following tables of data structures that 
define the file format are in the order that they appear in the file. 


The file format consists of six structures: 


i. 


Profiler File ID 
This is the label at the start of the file which initially identifies it as a profiler file. 


File Header 
This is the file's header which contains global information about the profile. 


Overlay Area Data 
This is a description of an overlay area. 


Function Data 
This contains data of a function that was called during the profile. 


Trace Function Call 
This describes a function call which has its own function data unique to that 
particular call path. 


Trace Function Return 
This describes a function return. 


The position of the Trace Function Call/Return structures in the file defines the function's call/ 
return relationship. 


If the file represents a Statistical Profile as opposed to a Trace Profile there will be no Trace 
Function Call or Trace Function Return structures in the file. 


21 


Statistical Profiler Tutorial 


The File Header indicates the type of profile. If it is a Statistical Profile the Number Of Function 
Calls and Trace Information Offset will be set to zero along with all the Counter data. In this 
case only the Function Hits data is valid. If the file represents a Trace Profile all the data is valid. 


Incorrect counter data (see Function Data Structure and Trace Function Call Structure) can 
occur when there is more than one Profile Event (JSR, RTS, Interrupt, Exception, RTE, BSR, 
BSRF) in the SH4's pipeline. This means there is no way of knowing which event incremented 
which counter and by what amount. This is an anomaly of the SH4's tracing hardware and the 
effect on the Profiler's results is usually negligible. 


The Counter Description Table describes all the counters available in he SH4's ASE mode debug 
facility. 


Profiler File ID Structure 


Byte Type Profiler File ID 
fe | Byte 0x45 'E’ 
Byte 0x52 R 
9 Byte 0x46 'F 
10 Byte 0x49 TT’ 
ll Byte Ox4c 'L 
12 Byte 0x45 'E' 


Ox3a 


= 
Ww 
w 
S 
oO 


ws) 
Nh 
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File Header Structure 


Type File Header 


Unsigned Long (LSB) 
Size of Header to follow. (0x00000055) 


(MSB) 


Instruction Cache : 0x01 Enabled (8k) , 0x00 Disabled. 


Unsigned Quadword 


Type of Profile 


0x01 - Statistical Profile - in this mode only the Function Hits 
are valid. All the counter data in every structure will be zero. 


0x02 - Trace Profile - all counters and hits are valid. 


Overall accumulative total number of Function Hits 
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Byte Type File Header 


Unsigned Quadword 


Overall accumulative total of Counter 1 


Unsigned Quadword (LSB) 


Overall accumulative total of Counter 1 minus the functions it 


called. 
(MSB) 

39 Unsigned Quadword (LSB) 
40 
41 
42 Overall accumulative total of Counter 2 
43 
45 
46 (MSB) 

Unsigned Quadword (LSB) 


Overall accumulative total of Counter 2 minus the functions it 
called. 


(MSB) 


55 Byte What Counter | represents, see the Counter Description Table at 
the end of the document. 

56 Byte What Counter 2 represents, see the Counter Description Table at 
the end of the document. 


57 Unsigned Long (LSB) 

58 

59 Number of Overlays 

60 (MSB) 
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Type 


Unsigned Long 
Unsigned Long 


Unsigned Long 


Unsigned Long 
Unsigned Long 
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File Header 
(LSB) 


Number of functions 


(LSB) 
Number of function calls. 


If Byte 14 is set to 0x01 (for Statistical Profile) then this will be 
set to zero. 


(MSB) 


(LSB) 
Overlay Information Offset 


Offset into this file from the End of the Header to the Overlay 
Information. This is only valid if the Number Of Overlays field is 
greater than zero. 


(MSB) 


(LSB) 
Function Information Offset 
Offset into this file from the End of the Header to the Function 
Information. 
(MSB) 


(LSB) 
Trace Information Offset 


Offset into this file from the End of the Header to the Trace 
Information (function calls/returns). If Byte 14 is set to 0x01 
(for Statistical Profile) then this will be set to zero. 


(MSB) 
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Overlay Area Data Structure 


The Overlay Data that follows is a list of Overlays that exist within the program. The data is 
repeated Number Of Overlays times. Number Of Overlays can be found in the File Header. 


Overlay Area Data 


Unsigned Long (LSB) 


Size of Overlay Area Data to follow. 


Unsigned Long (LSB) 


Overlay Area Index ID. 


Unsigned Long (LSB) 


Start Address of Overlay Area. 


Unsigned Long (LSB) 


End Address of Overlay Area. 


Unsigned Long (LSB) 


Current Overlay ID (at the time of saving). 
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Function Data Structure 


The Function Data that follows is a list of functions called during the profile with Accumulative 
Totals for that function regardless of call path. The data is repeated Number Of Functions times. 
Number Of Functions can be found in the File Header. 


Type Function Data 


Unsigned Long (LSB) 
Size Of Function Data to Follow. 


Byte Function Description. 
0x00 - Unknown. 
0x01 - Function. 
0x02 - Interrupt/Exception (unknown VBR + ?). 
0x03 - Exception VBR + 0x100. 
0x04 - Exception VBR + 0x400. 
0x05 - Interrupt VBR + 0x600. 
0x06 - Exception DBR. 
0x07 - User Defined Block. 


Unsigned Word Attributes (Bit Field) 
0x00000001 - This function has possible incorrect counter data. 
0x00000002 - This function was Tagged (Selected) by the user. 
All other bit-fields will be 0. 


Unsigned Word (LSB) 
Cache Line Number. 
0 - 256 (256 lines * 32 Bytes = 8k) , OxFFFF Non Cacheable 
(MSB) 
Unsigned Long (LSB) 
Minimum Program Counter of Function Scope. 
Unsigned Long (LSB) 
Max Program Counter of Function Scope. 
Unsigned Long (LSB) 
Overlay Area Index ID 
ID OxFFFFFFFF = Not part of an overlay. 
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Type 


Unsigned Long 


Unsigned Long 


Unsigned Quadword 


Unsigned Quadword 


Unsigned Quadword 
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Function Data 


(LSB) 
Overlay ID 
ID OxFFFFFFFF = Not an Overlay. 


(LSB) 
Function Group ID 
ID OxFFFFFFFF = No Group. 


Accumulative total number of Function Hits for this function. 


(MSB) 


Accumulative total of Counter | for this function. 


(LSB) 


Accumulative total of Counter 1 minus the functions it called 
for this function. 
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Function Data 


Unsigned Quadword (LSB) 


Accumulative total of Counter 2 for this function. 


Unsigned Quadword (LSB) 


Accumulative total of Counter 2 minus the functions it called 
for this function. 


(LSB) 


The size of the following function name. ('n1’) 


Function Name. 
The string is of the size specified in the previous field. 


(LSB) 
The size of the following Linker Label. ('n2’) 


Linker Label. 
The string is of size specified in the previous field. 
The Linker Label is the label that was used to link this function. 


(LSB) 
The size of the following Qualified Name. 


Qualified Name. 
The string is of size specified in the previous field. 


The Qualified Name is used to specify a function in greater 
detail (filename, class etc.) in the event of a duplicate function 
name. 
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Trace Function Call Structure 


The Function Trace Data (function calls/returns) consists of two data structures Trace Function 
Call and Trace Function Return. To distinguish between them the first byte after the data size 
is OxFF for a function call and 0x00 for a return. There is a file offset to this data that is stored 
in the File Header. Also stored in the File Header is the Number of Function Calls. The number 
of following Trace Function Calls is coupled with an equal amount of Trace Function Returns. 


Byte Type Trace Function Call 
0 Unsigned Long (LSB) 
1 
2 Size of Trace Function Call Data to follow. 
3 
5 Unsigned Long (LSB) 
6 Unique Call Number. 
7 This number is present in the matching 'Return From Function’ 
8 data. 
(MSB) 
9 Byte Function Description. 
0x00 - Unknown. 
0x01 - Function. 
0x02 - Interrupt/Exception (unknown VBR +). 
0x03 - Exception VBR + 0x100. 
0x04 - Exception VBR + 0x400. 
0x05 - Interrupt VBR + 0x600. 
0x06 - Exception DBR. 
0x07 - User Defined Block. 
10 Unsigned Long Attributes (Bit Field) 
ll 0x00000001 - This function has possible incorrect counter data. 
All other bit-fields will be 0. 
12 Unsigned Word (LSB) 
13 Cache Line Number. 
0 - 256 (256 lines * 32 Bytes = 8k), OxFFFF(-1) Non Cacheable 
(MSB) 
14 Unsigned Long (LSB) 
15 
16 Minimum Program Counter of Function Scope. 
17 
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Type Trace Function Call 


Unsigned Long (LSB) 

Max Program Counter of Function Scope. 
Unsigned Long (LSB) 

Overlay Area Index ID 

ID OxFFFFFFFF = Not part of any overlay. 
Unsigned Long (LSB) 

Overlay ID 

ID OxFFFFFFFF = Not an overlay. 
Unsigned Long (LSB) 

Function Group ID 

ID OxFFFFFFFF = No Group. 


Unsigned Quadword (LSB) 


Accumulative total number of Function Hits for this function 


with this call path. 


Unsigned Quadword) (LSB) 


Accumulative total of Counter 1 for this function with this call 
path. 
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Unsigned Quadword 


Unsigned Quadword 


Unsigned Quadword 


Trace Function Call 


(LSB) 


Accumulative total of Counter 1 minus the functions it called 
for this function with this call path. 


(LSB) 


Accumulative Total of Counter 2 for this function with this call 
path. 


(LSB) 


Accumulative Total of Counter 2 minus the functions it called 
for this function with this call path. 


(LSB) 
The Size of the Following Function Name. ('n1') 


Function Name. 
The string is of size specified in the previous field. 


(LSB) 
The Size of the following Linker Label. ('n2’) 


Linker Label. 
The String is of size specified in the previous field. 
The Linker Label is the label that was used to link this function. 


(LSB) 
The Size of the following Qualified Name. 


84 +'nl' +2 
+'n2' +2 
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Trace Function Call 


Qualified Name. 


The string is of size specified in the previous field. 

The Qualified Name is used to specify a function in greater 
detail (filename, class etc.) in the event of a duplicate function 
name. 
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Trace Function Return Structure 


Trace Function Return 
Unsigned Long (LSB) 


Size Of Trace Function Return Data to Follow. (0x00000005) 
(MSB) 


Unsigned Long (LSB) 


Unique Call Number. This number is to ensure that the return 
Unique Number matches the Call Unique Number. 


(MSB) 


34 


CodeScape Statistical Profiler Guide 


Counter Description Table 


Counter Description Number Counter Description Count/Cycles 


Operand access (write/with cache) Count 


Instruction fetch (with cache) - 2 Count 
instructions fetched simultaneously 

Instruction cache miss Count 

All instruction access - 2 instructions Count 
fetched simultaneously 

Operand access (read + write/with cache) Count 


FPU instruction execution Count 


Count 
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Counter Description Number Counter Description Count/Cycles 


0x18 TRAPA instruction execution Count 


UBC-A match Count 


UBC-B match Cycles 


0x21 Instruction cache fill 


0x22 Operand cache fill 
0x23 Elapsed time Cycles 
0x24 


Pipeline freeze (by cache miss/instruction) Cycles 


Pipeline freeze (by cache miss/data) 
0x27 Pipeline freeze (by branch instruction) Cycles 
0x28 Pipeline freeze (by CPU register) Cycles 
0x29 Pipeline freeze (by FPU) 
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